NLP & DBpedia An Upward Knowledge Acquisition Spiral
نویسندگان
چکیده
Recently, the DBpedia community has experienced an immense increase in activity and we believe, that the time has come to explore the connection between DBpedia & Natural Language Processing (NLP) in a yet unprecedented depth. DBpedia has a long-standing tradition to provide useful data as well as a commitment to reliable Semantic Web technologies and living best practices. As the extraction of the Wikipedia’s infoboxes by DBpedia matures, we can shift our focus to new challenges such as extracting information from an unstructured article text as well as becoming a testing ground for multilingual NLP methods. DBpedia has the potential to create an upward knowledge acquisition spiral as it provides a small amount of general knowledge allowing to process text, derive more knowledge, validate this knowledge and improve text processing methods. The goal of this workshop was to present existing research, systems and resources, but also to allow discussion about different points of convergence and divergence of the NLP and DBpedia community with a special focus on challenges that lie ahead. We would like to take part in the debate on how to use DBpedia for NLP and NLP for DBpedia.
منابع مشابه
Extending DBpedia with Wikipedia List Pages
Thanks to its wide coverage and general-purpose ontology, DBpedia is a prominent dataset in the Linked Open Data cloud. DBpedia’s content is harvested from Wikipedia’s infoboxes, based on manually created mappings. In this paper, we explore the use of a promising source of knowledge for extending DBpedia, i.e., Wikipedia’s list pages. We discuss how a combination of frequent pattern mining and ...
متن کاملDBlexipedia: A Nucleus for a Multilingual Lexical Semantic Web
A huge amount of datasets on the Semantic Web are linked to a few datahubs, the most prominent of which is DBpedia. What makes the exploitation of DBpedia challenging for natural language-based applications, however, is that such NLP applications require knowledge about how the ontology elements are verbalized in natural language. In order to provide such knowledge at the required scale and the...
متن کاملThe German DBpedia: A Sense Repository for Linking Entities
The modeling of lexico-semantic resources by means of ontologies is an established practice. Similarly, general-purpose knowledge bases are available, e.g. DBpedia, the nucleus for the Web of Data. In this section, we provide a brief introduction to DBpedia and describe recent internationalization efforts (including the creation of a German version) around it. With DBpedia serving as an entity ...
متن کاملMissing Mr. Brown and Buying an Abraham Lincoln - Dark Entities and DBpedia
We argue for the need for the community to address the issue of “dark entities”, those domain entities for which a knowledge base has no information in the context of the entity linking task for building Event-Centric Knowledge Graphs. Through an analysis of a large (1,2 million article) automotive newswire corpus against DBpedia, we identify six classes of errors that lead to dark entities. Fi...
متن کاملDBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia
The DBpedia community project extracts structured, multilingual knowledge from Wikipedia and makes it freely available using Semantic Web and Linked Data standards. The extracted knowledge, comprising more than 1.8 billion facts, is structured according to an ontology maintained by the community. The knowledge is obtained from different Wikipedia language editions, thus covering more than 100 l...
متن کامل